首页> 外文OA文献 >Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem
【2h】

Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem

机译:调整大数据的小分析:Hadoop生态系统中的数据分区和二级索引

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In the recent years the problems of using generic storage (i.e., relational) techniques for very specific applications have been detected and outlined and, as a consequence, some alternatives to Relational DBMSs (e.g., HBase) have bloomed. Most of these alternatives sit on the cloud and benefit from cloud computing, which is nowadays a reality that helps us to save money by eliminating the hardware as well as software fixed costs and just pay per use. On top of this, specific querying frameworks to exploit the brute force in the cloud (e.g., MapReduce) have also been devised. The question arising next tries to clear out if this (rather naive) exploitation of the cloud is an alternative to tuning DBMSs or it still makes sense to consider other options when retrieving data from these settings.; In this paper, we study the feasibility of solving OLAP queries with Hadoop (the Apache project implementing MapReduce) while benefiting from secondary indexes and partitioning in HBase. Our main contribution is the comparison of different access plans and the definition of criteria (i.e., cost estimation) to choose among them in terms of consumed resources (namely CPU, bandwidth and I/O).
机译:近年来,已经发现并概述了将通用存储(即,关系)技术用于非常特定的应用的问题,结果,出现了关系DBMS(例如,HBase)的一些替代方案。这些替代方案中的大多数都位于云上并从云计算中受益,如今,这是一个现实,可以帮助我们通过消除硬件和软件固定成本以及按使用付费的方式来节省资金。最重要的是,还设计了利用云中蛮力的特定查询框架(例如MapReduce)。接下来出现的问题试图弄清楚这种对云的利用(而不是幼稚的)是替代DBMS的替代方法,还是从这些设置中检索数据时考虑其他选项仍然有意义。在本文中,我们研究了利用Hadoop(实施MapReduce的Apache项目)解决OLAP查询的可行性,同时受益于HBase中的二级索引和分区。我们的主要贡献是比较了不同的访问计划和标准的定义(即成本估算),以便根据消耗的资源(即CPU,带宽和I / O)进行选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号